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ABSTRACT 

This is the second of two papers which address the problem of measuring the unred- 
sliifted power spectrum of fluctuations from a galaxy survey in optimal fashion. A key 
quantity is the Fisher matrix, which is the inverse of the covariance matrix of mini¬ 
mum variance estimators of the power spectrum of the survey. It is shown that bases 
of kernels which give rise to complete sets of statistically orthogonal windowed power 
spectra are obtained in general from the eigenfunctions of the Fisher matrix scaled 
by some arbitrary positive definite scaling matrix. Among the many possible bases 
of kernels, there is a basis, obtained by applying an infinitely steep scaling function, 
which leads to kernels which are positive and compact in Fourier space. This basis 
of kernels, along with the associated minimum variance pair weighting derived in the 
previous paper, would appear to offer a solution to the problem of how to measure 
the unredshifted power spectrum optimally. Illustrative kernels are presented for the 
case of the PSCz survey. 
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1 INTRODUCTION 

This is the second of a pair of papers (Hamilton 1997, here¬ 
after Paper 1) concerned with the problem of measuring the 
unredshifted power spectrum of fluctuations in a galaxy sur¬ 
vey in optimal fashion. 

As discussed by Tegmark, Taylor & Heavens (1997), a 
fundamental quantity in the estimation of any set of param¬ 
eters from a data set is the Fisher information matrix, which 
is the matrix of second derivatives of minus the log-likelihood 
function with respect to the parameters, and whose inverse 
gives an estimate of the covariance matrix of the parame¬ 
ters. It was demonstrated in Paper 1 that if the parameter 
set of interest is the unredshifted power spectrum of fluctua¬ 
tions in a galaxy survey, and under the standard assumption 
that the main source of uncertainty in the power spectrum is 
cosmic variance plus Poisson sampling noise, then the Fisher 
matrix can be evaluated rather directly, albeit in an approxi¬ 
mation. With the Fisher matrix in hand, it becomes possible 
to construct complete sets of kernels which yield statistically 
orthogonal sets of windowed power spectra. Such bases of 
kernels hold the potential of providing an optimal way to 
measure the power spectrum, in the sense that they offer 
statistical orthogonality (which implies statistical indepen¬ 
dence in the case of Gaussian fluctuations), and complete¬ 


ness (hence maximum resolution, since the power spectrum 
cannot be decomposed further into statistically orthogonal 
subunits). The purpose of the present paper is to explore 
this idea further, and in particular to show some actual ex¬ 
amples. I choose to use the geometry and selection function 
of PSCz survey (W. Saunders et al., in preparation) for illus¬ 
tration, and I assume Gaussian fluctuations for simplicity. 

As shown in §B|. there are in fact many distinct bases 
of kernels which give rise to statistically orthogonal sets of 
windowed power spectra. After giving details of the compu¬ 
tation of the Fisher matrix in §0, I show two trial bases of 
kernels in ijij. It is found there that, without special atten¬ 
tion, kernels are liable to be broad and wiggly in Fourier 
space, which are not desirable properties. The difficulty is 
resolved in where it is shown how to construct a ba¬ 
sis of kernels which are compact and everywhere positive 
in Fourier space. Section |E] leaves unanswered a number of 
basic questions as to the generality and uniqueness of the 
proposed construction. These questions are posed ‘pj. The 
results are summarized in (>7|. 
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2 STATISTICALLY ORTHOGONAL SETS OF 
POWER SPECTRA 

Let £q, denote the power spectrum £(fc) windowed through 
a set of kernels ip a (k)\ 

/*oo 

'ip a (k)£(k)47rk 2 dk/(2n) 3 . (1) 

Jo 

The power spectrum £(fc) is real, and the kernels ip a (k) can 
also be taken to be real without loss of generality. Let £ a de¬ 
note the minimum variance estimate of the windowed power 
spectrum (as in Paper 1, the hat on £ is used to distin¬ 
guish the estimator from the true power spectrum £) 

£ a = J W(ri,rj)5(ri)5(rj)d 3 nd 3 rj ( 2 ) 

where W(r-i,rj) is the minimum variance pair window de¬ 
rived in Paper 1, equation (39), for the power spectrum win¬ 
dowed through the kernel G(k) = tp a (k). The integration 
in equation (^) is over all pairs of observed overdensities 
5(ri)8(rj ) at positions r; and rj in the survey. The esti¬ 
mators £ a of the windowed power spectra £ a are said to be 
statistically orthogonal if the expectation value of the co- 
variance between different elements vanishes, 

(A&A&) =0 (a / J3) . (3) 


In practical applications, notably when using the basis 
of positive kernels presented in (j^], one might prefer to 
normalize the kernels linearly to unit volume in fc-space, 
J 0 °° , 0a(fc)47rfc 2 dfc/(27r) 3 = 1. The difficulty with the lin¬ 
ear normalization - more a matter of plotting rather than 
anything fundamental - is that for kernels ip a (k) which are 
not everywhere positive in fc-space, such as those considered 
in sjj], a linear rather than quadratic normalization tends 
to make noisy kernels wiggle more fiercely, obscuring the 
better-behaved kernels. 

In a moment I will prove the following theorem. For any 
complete set of kernels ip a (k), along with any set of normal¬ 
ization constants c a , yielding a statistically orthogonal set 
of windowed power spectra there exists a strictly posi¬ 
tive definite - i.e., all the eigenvalues are strictly positive - 
real symmetric scaling matrix 7 (k, k') such that the scaled 
kernels rf) a (k) defined by 

/*oo 

4> a (k) = / q(fc, k')caipa(k') Ank' 2 dk'/(2n) 3 (5) 

Jo 

form a complete orthonormal basis which diagonalizes the 
scaled covariance matrix 7 _ 1 (A£A£) 7 ~ 1 . 

The converse of this theorem is evident. If 7 is any 
strictly positive definite real symmetric matrix, then the 
scaled covariance matrix 7 _ 1 (A£A£) 7 _1 defines a complete 
orthonormal set of eigenfunctions cj > a , with eigenvalues A a : 

^/~\a£a£)^~ 1 (/) a = \ a <f> a ■ ( 6 ) 


Statistical orthogonality does not imply statistical indepen¬ 
dence in general, but it does imply statistical independence 
for the particular case of Gaussian fluctuations, for which by 
definition all the higher order irreducible moments vanish. 

There are not one but many distinct bases, i.e. com¬ 
plete linearly independent sets, of kernels ip a which define 
statistically orthogonal sets of power spectra £ a . To see this, 
consider first that the eigenfunctions of the expected covari¬ 
ance matrix (A£(fc)A£(fc')) of estimators £(k) of the power 
spectrum define a certain complete orthonormal set of ker¬ 
nels ip a (k), for which by construction the windowed power 
spectra £ a are statistically orthogonal. Now rescale each ker¬ 
nel ip a (k) to unit variance, by dividing it by the square root 
of the variance, ipa —> ip a /(A^) 1 ^ 2 . The covariance matrix 
of the power spectrum windowed through the resulting basis 
of scaled kernels is the unit matrix. Any orthogonal rotation 
amongst these scaled kernels yields another basis of kernels, 
which defines another distinct statistically orthogonal set of 
windowed power spectra. 

If il>c,(k) is a set of kernels which defines a statistically 
orthogonal set of estimators £ a of windowed power spec¬ 
tra, then evidently c a ipa(k), where c a is any set of finite 
nonzero normalization constants, defines a renormalized ver¬ 
sion of essentially the same statistically orthogonal set, with 
—> Ca^a and (A£q) —> c 2 a { A£(;). In the development be¬ 
low it is informative to carry along a set of normalization 
constants c a . The normalization constants can be chosen ar¬ 
bitrarily, so that one is free to normalize the kernels ipa(k) in 
whatever manner seems convenient. In the present paper I 
choose to plot the kernels quadratically normalized, so that 
their squares have unit volume in fc-space, 

ip a (k) 2 dnk 2 dk/(2n) 3 = 1 . (4) 



The kernels tp a defined by ij} a = c^ 1 7 _1 </> a , the inverse of 
equation (^), then yield estimators £ a of windowed power 
spectra, equations (]Ij) and (j^), which are statistically or¬ 
thogonal because 

(A£ a A ip) = tpa {A£A£)ipp = c~ 1 rf) a 'y~ 1 (A£A£)'y- 1 (l> 0 c 0 1 

= C a 2 \ a 8Daf3 ( 7 ) 


is a diagonal matrix (no summation over a or /3 implied). 
Here 5oa 0 denotes the unit matrix. In particular, the vari¬ 
ances (A£a) of the windowed power spectra are re¬ 
lated to the eigenvalues A a of the scaled covariance matrix 
q-^A^AOq ^ 1 by 


(A£> = b 

Ca 


( 8 ) 


As seen in Paper 1, what one actually calculates for 
a survey is not the expected covariance matrix (A£A£) of 
estimates of the power spectrum, but rather its inverse the 
Fisher matrix T = (A£A£) _1 . The inverse of the scaled 
covariance matrix 7 _ 1 (A£A£) 7 _1 is the scaled Fisher matrix 
7 T 7 . Thus the eigenfunctions 4>a of the scaled covariance 
matrix can in practice be constructed as the orthonormal 
eigenfunctions of the scaled Fisher matrix, 


^/T'ycpa = H a <t>a , 


( 9 ) 


with eigenvalues fj, a = 1/A a which are the reciprocals of 
the eigenvalues of the scaled covariance matrix. The kernels 
ip a follow as before from ip a = c^ 1 / y~ 1 (/)a, the inverse of 
equation (^]). The variances (A^) of the windowed power 
spectra £ a are related to the eigenvalues of the scaled 
Fisher matrix by, equation (^|), 


<a£> 


1 

Cck/Tq, 


( 10 ) 
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Thus the general procedure for constructing complete 
bases of kernels yielding statistically orthogonal sets of 
power spectra is as follows: Choose any strictly positive def¬ 
inite symmetric scaling matrix 7 ; find the eigenfunctions (j> a 
of the scaled Fisher matrix 7 T 7 ; and construct the kernels 
ip a oc 7 _1 (/> q , normalizing to taste. 

The next few paragraphs prove the previously stated 
theorem, which asserts that a scaling matrix 7 exists for 
any complete set of kernels yielding a statistically orthogo¬ 
nal set of power spectra, and which therefore guarantees that 
the procedure of the previous paragraph is in fact general. 
Suppose then that ip a (k) comprise a complete set of kernels 
which give rise to a statistically orthogonal set of win¬ 
dowed power spectra, equations (jlj) , (Q), and (|3|). The kernels 
ipa(k) themselves need not be orthogonal, but they must be 
linearly independent. For if one of the kernels depended lin¬ 
early on the others, say tp a = 3 for some ap, then 

the corresponding windowed power spectrum would likewise 
depend linearly on the others, = Ylp^a a /3?/3i so that 
(A£ q A £p) = ap{A£p) would be nonzero for some /3, con¬ 
tradicting the statistical orthogonality of the £ a . Linear in¬ 
dependence, coupled with the assumption of completeness, 
means that the kernels ip a (k) form a basis. 

Define the real symmetric matrix A(k, k') by 

A(k,k') = '^ / cl,ip a (k)ip a (k') . (11) 


Being real and symmetric, the matrix A is diagonalized by 
some orthogonal transformation O, so that O t AO is diago¬ 
nal. Rotating equation jll| ) by the orthogonal transforma¬ 
tion O, one concludes that the matrix A must be positive 
definite - that is, all its eigenvalues are positive or zero - 
since each eigenvalue is the inner product of a vector Oci/j 
with itself. The assumption that the kernels 1 p a form a com¬ 
plete set implies further that the eigenvalues of A are all 
strictly positive. For equations (jjj) and (ill]) together imply 



A(k, k')^(k') Ank /2 dk'/(^n) 3 = ^ ' Cgip a (k)^ a , 

OL 


( 12 ) 


and if there existed a nontrivial eigenfunction £ ( k ) of A with 
zero eigenvalue, then the left hand side of equation ( |l2| ) 
would vanish for this eigenfunction. The consequent van¬ 
ishing of the right hand side of equation (|I 0 ) would then 
imply either that the kernels ipa(k) are linearly dependent, 
which is false as shown above, or else that £ a = 0 for all a 
for the eigenfunction £(fc). But if, as postulated, the kernels 
ipa(k) form a complete set, then £( k ) = ^ a * a ^ a (fe) for 
some set of coefficients x a , so that J £(fc) 2 47rfc 2 dfc/(27r) 3 = 
J 2 a Xa J ipa(k)£(k)Ank 2 dk/( 2 n ) 3 = = whose 

only possible solution is £(fc) = 0. Thus the matrix A can 
have no nontrivial eigenfunctions with zero eigenvalue, and 
hence A must be strictly positive definite, as claimed. 

It follows that the matrix A of equation ([II]) can be 
written in the form A = OA 2 O t = (OAO T ) 2 = 7 -2 where 
A is a diagonal matrix, and 7 = OA - 1 O t . Without loss 
of generality, all the square roots in A = (A 2 ) 1 / 2 can be 
taken to be positive. Thus the matrix 7, which I call the 
scaling matrix, is also a strictly positive definite real sym¬ 
metric matrix. It then follows from equation ( |TT| ) that the 
scaled kernels 4> a {k) defined by equation (^) form a complete 
orthonormal set 


^2(j> a {k)cj) a {k') = (2n) 3 5 D (k - k 1 ) (13) 


the Dirac delta-function (2n) 3 8n(k — k') being the unit ma¬ 
trix in the Fourier representation. In accordance with the 
convention of Paper 1, the Dirac delta-function in equa¬ 
tion © is normalized on a 3-dimensional scale, so that 
J 5o{k — fc') 47 rfc' 2 dfc' = 1 . Equation © shows that (j> a {k) 
is in effect an orthogonal matrix (f>k a , whose inverse is its 
transpose. Taking the factors in opposite order yields the 
equivalent orthonormality condition 



(j) a {k)(i)p{k) Ank 2 dk/(2n) 3 = Snap 


(14) 


where Snap is the unit matrix in the ^-representation. 

The desired scaling matrix 7 has thus been constructed, 
as asserted by the theorem, with the property that the scaled 
kernels (f> a defined by equation (|j) form a complete orthonor¬ 
mal set. The final assertion of the theorem, that the scaled 
covariance matrix 7 ^ 1 {A£A£) 7~ 1 is diagonal with respect 
to the scaled kernels </> a , follows directly, equation ( 0 ), from 
the hypothesis that the covariance matrix (A£A£) is diago¬ 
nal with respect to the kernels tp a . This completes the proof 
of the theorem. 

From the definition (0) of and the orthonormality 
condition © on the scaled kernels 0 Q (fc) defined by equa¬ 
tion (| 0 ), it follows that the power spectrum £(fc) can be 
expanded in terms of the kernels ipa(k) as 


£<»=y^£ a c 2 


7 2 (fc, k')ip a (k') An k' 2 dk'/(2 tt) 3 (15) 


[note 7 2 (fc, fc') here is not [ 7 (fc, fc ')] 2 but rather 7 2 (k,k') = 
fo* 3 7(k, k")-f(k", fc')47rfc ,,2 dfc ,, /(27r) 3 ]. The corresponding 
minimum variances estimators £(fc) and are similarly re¬ 
lated, by an expression which is essentially identical to ©■ 
Being real and symmetric, the scaling matrix 7 (fc, fc') 
must be diagonal in some representation or other. All the 
scaling matrices considered in the present paper are diagonal 
in Fourier space, so that the scaling matrix becomes a scaling 
function 7 (fc). In that case, the relation (|j|) between the 
eigenfunctions (j> a {k ) and the kernels 1 p a {k) simplifies to 


<t>a{k) = 7 (fc)c a l/ , tt(fc) , 


(16) 


the orthogonality condition © becomes 



ipa(k)i/jp{k) 7 (fc ) 2 


47 rfc 2 dfc 

(2tt) 3 


0 (a^/3) 

c “ 2 (a = (3) 


(17) 


and the expansion (|I 0 ) of the power spectrum £(fc) in the 
kernels ip a (k) reduces to 


£ 0 ) ='^2^cl^ a (k)'y(k) 2 . 

a 


(18) 


The eigenfunctions (f> a (k ) of the scaled Fisher matrix 7 T 7 
are found by solving the eigenvalue equation (!)j), which re¬ 
duces to 

Ank’dk' 

J 7(fc)r(fc,fc / )7(fc')^(fe / ) ( 27r p = ra<l>a(k) . (19) 

The relation (j^j) between the variances (A£ 2 ) and the eigen¬ 
values pL a of the scaled Fisher matrix remains unchanged. 
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3 COMPUTATION OF THE FISHER MATRIX 

A perturbation procedure for computing the Fisher matrix 
T(k, k') of the unredshifted power spectrum of a galaxy sur¬ 
vey has been described in Paper 1. Here I assume for simplic¬ 
ity Gaussian fluctuations, and the zeroth order approxima¬ 
tion, which is essentially the classical approximation consid¬ 
ered by Feldman, Kaiser & Peacock (1994, hereafter FKP), 
where position and wavelength are simultaneously measur¬ 
able. 

To illustrate the procedures proposed in this paper, I 
take the geometry of the PSCz survey, which is the redshift 
survey of IRAS galaxies complete to 0.6 Jy (Saunders et 
al., in preparation). I take the angular geometry of PSCz 
to be the same as that of the QDOT survey (Lawrence et 
al. 1997), which covers most of the sky above galactic lat¬ 
itude |&| > 10°. I take the radial selection function to be 
6 times that of QDOT, which was designed to be a l-in-6 
random sampling of IRAS galaxies to 0.6 Jy. The QDOT 
radial selection function is measured using Turner’s (1979) 
method, after making cosmological corrections for the vol¬ 
ume element (assuming a flat Universe), and for evolution 
of the luminosity function (adopting luminosity evolution 
L oc (1 + z) 1 ' 8 ). I chose also to exclude the local region 
closer than 25/i _1 Mpc, to eliminate possible bias from the 
local overdensity; in practice this has little effect at the long 
wavelengths illustrated in this paper. 

One test of the accuracy of the zeroth order approx¬ 
imation to the Fisher matrix adopted here is that the 
Fisher matrix should be strictly positive definite, that is, 
all its eigenvalues should be strictly positive. In the PSCz 
survey, I find that if the approximate Fisher matrix is 
computed with a resolution in Fourier space of A k = 
7t/( 896 /i -1 Mpc) = 0.00351 /iMpc -1 , or coarser, then all the 
eigenvalues are positive. At finer resolutions an increasing 
fraction of the eigenvalues are negative. For the trial bases 
exhibited in olj, I chose to over-resolve the kernels, using 
A k = 7r/(2048 h^ 1 Mpc) = 0.00153 /iMpc^ 1 , which appears 
to do no harm to kernels of low variance. At this resolution 
~ 22% of the eigenvalues of the Fisher matrix are negative, 
but the absolute values of the negative eigenvalues are all 
less than 10~ 4 of the largest eigenvalue. 

The zeroth order approximation to the Fisher matrix is 
given in Paper 1, equation (86), as 

T{k,k') = - t f\U(k + k')\ 2 dodo/(4n) 2 (20) 


where do denotes an interval of solid angle and the in¬ 
tegration is over all directions of k and k ', and where 
U(k) = J U(r)e lk r d 3 r is the Fourier transform of the FKP 
window (Paper 1, eq. [72]) 


U(r) 


<F(r) 

l+£(fc 0 Mr) ' 


( 21 ) 


Here < F(r) is the selection function, the expected number 
density of galaxies at position r in the survey, and £(fco) is 
the assumed prior power spectrum at wavenumber feo- The 
higher order terms of the Fisher matrix T(k, k') comprise an 
expansion in £(fc) — £(fco) (Paper 1). Thus the greatest ac¬ 
curacy in the zeroth order term is accomplished by choosing 
fco close to k and k! ; I adopt 


k 0 = (k + k')/2 . 


( 22 ) 


o 

a 

s 

cn 

* 1/1 

c 

<D 
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'3 
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Separation r (h 1 Mpc) 


Figure 1. Expected density R(r) of weighted random pairs at 
separation r, equation (2f|), in the PSCz survey. Curves, from top 
to bottom, are for prior powers £(fco) = I0 2 ■ 1 0 8 . 10 4 , and 


10 5 h 3 Mpc' in the survey window U(r), equation fell). 


The approximation should be quite good, since the Fisher 
matrix (^[]) is almost diagonal in Fourier space, its largest 
elements satisfying k ~ k!. 

The power spectrum £(fco) in the survey window ( |2l| ) 
can be regarded as a prior in Bayesian model testing, or al¬ 
ternatively as a reasonable estimate which may in practice 
lead to kernels yielding nearly statistically orthogonal sets 
of windowed power spectra. I adopt the unredshifted power 
spectrum proposed by Peacock (1997). Peacock gives ana¬ 
lytic formulae for the power spectrum in two cases, a flat 
model with H = 1, and an open model with fl = 0.3; the 
results shown in the present paper are for the flat model. 

It is convenient to evaluate the Fisher matrix © via 
an excursion back into real space. In the real representation, 
the Fisher matrix © is 

T(r,r') = ^8D(r-r')R(r) (23) 

where R(r) is the integral of the pair window U(ri)U(rj) 
over all pairs ij in the survey separated by r 

R{r) = J U(ri)U(rj)SD{\ri - rj\- r)d 3 nd 3 rj . (24) 

Beware of equation (p3|)! It does not signify that the Fisher 
matrix is diagonal in real space, because the prior power 
spectrum £(fco) in the survey window U(r), equation 
is a different constant for different elements of the Fisher 
matrix T{k,k') (the Fisher matrix would be diagonal in 
real space if the prior power spectrum were that of shot 
noise, £(fc) = constant, but that is not the case here). In 
accordance with the convention of Paper 1, the Dirac delta- 
functions in equations ( |23| ) and ([ijJ) are normalized on a 3- 
dimensional scale, so that f SD(r — r')4xr ,2 dr' = 1. The pair 
density R(r), which in the literature is often denoted (RR) 
(for random-random), is the expected number of weighted 
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Figure 2. One-dimensional Fourier transform R(k), equa¬ 
tion (jifil), of the pair density R(r), Figure |l], in the PSCz survey. 
Curves, from top to bottom, are for prior powers £(fco) = 10, 10 2 , 
10 3 , 10 4 , and 10 6 h“ 3 Mpc 3 . 

random pairs of galaxies in the survey per unit volume in¬ 
terval 47rr 2 dr at separation r. Although the pair density is 
commonly computed by Monte Carlo integration, I find it 
faster, more accurate, and more convenient (since the pro¬ 
gram was already written) to compute the integral directly, 
using the procedures described by Hamilton (1993). Figure hi 
illustrates the pair density R(r) in the PSCz survey for var¬ 
ious values of the prior power £(fco) upon which the survey 
window U(r) depends, equation (Ell) . 

Transforming the Fisher matrix ( ]23| ) from real space 
back into Fourier space, T(k,k') = f T(r,r')e lk r+lk r 
d 3 rdV, yields 

T(k,k') = ^[R(k-k')-R(k + k')\ (25) 

where R(k) is the 1-dimensional Fourier transform of the 
pair density R(r) 

/* OO 

R(k) = 2 / R(r) cos(fcr) dr . (26) 

Jo 

Figure ^ illustrates the pair density transform R(k) in the 
PSCz survey. As expected, the pair density transform is 
peaked to small wavenumber k. 

The resolution in fc-space with which the power spec¬ 
trum can be measured is roughly set by the inverse scale of 
the survey, so it makes sense to discretize the Fisher ma¬ 
trix T(fc m ,fc n ) on a uniform grid in k. A uniform grid in k 
is also a natural choice numerically, both in evaluating the 
pair density transform R(k), equation (^), by fast Fourier 
transform, and then in constructing the Fisher matrix ( |25[ ) 
from R(k) evaluated at k = k m dt k n . 

I proceed numerically as follows. Firstly, compute the 
pair density R{r), using the method detailed by Hamilton 
(1993), over a table of separations r and constants £(fco), log¬ 


arithmically spaced at 0.02 dex in r, and 0.1 dex in £(fco). To 
facilitate taking a fast Fourier transform, interpolate on R(r) 
to define a new table linearly spaced typically at intervals 
0.5 ft _1 Mpc in r. Optionally zero-pad this table, typically to 
a separation of 2048 ft _1 Mpc, which results in a grid spacing 
in fc-space of Afc = 7r/(2048/i _1 Mpc) = 0.00153 fcMpc^ 1 . 
Although zero-padding increases the resolution with which 
the Fisher matrix and its eigenfunctions are sampled, it 
does not increase the physical resolution available from the 
survey. For each matrix element T(fc m ,fc„), choose fco = 
(fc m + k n )/ 2, equation (p2|); set £(fco) to be the = 1 power 
spectrum proposed by Peacock (1997); interpolate logarith¬ 
mically on the table of the pair density R(r ) as a function of 
£(fco) to find R(r) for the desired £(fco); fast Fourier (cosine) 
transform R(r) to find R(k), equation (^g[) ; and thence ob¬ 
tain T(km, kn), equation (|2E|). Since fco = (fc m + fc n )/2 is con¬ 
stant along cross-diagonals of the Fisher matrix T(fc m , k n ), 
the fast Fourier transform need be carried out only for each 
cross-diagonal, not for each element of the matrix. 

The above procedure yields the Fisher matrix T(fc m , fc n ) 
on a uniformly spaced grid of k„ = nAfc with n = 
0,1 ,,N. According to §f4 bases of kernels yielding statis¬ 
tically orthogonal sets of windowed power spectra are con¬ 
structed by diagonalizing matrices of the form 7T7, where 
7 is some strictly positive definite symmetric matrix. I11 all 
cases considered in the present paper the scaling matrix 7 
is diagonal in Fourier space, that is, 7(fc) is some strictly 
positive function of fc. The equation to be solved is then 
the eigenvalue equation ©■ I11 discretized form, this eigen¬ 
value equation (|l9|) becomes (there is no implicit summation 
in the following equation, nor elsewhere in this paper) 

N 

’ymFmn^'frLUna = (27) 

71 = 0 

where 7 m = 7(fcm), the matrix F mn is a discrete version of 
the Fisher matrix T 

Fmn = k m T(km, k n )k n 47rAfc/(27r) 3 , (28) 

the eigenvalues fi a are the same as in equation (© , and the 
eigenfunctions u ma = u a {k m ), satisfying the orthonormality 
condition ^ m =o M a(fcm)M/s(fcm) = S a / 3 , are discrete versions 
of the eigenfunctions (j> a {k m ) 

Uma = Uai^km) — fcm^a(^m) ^4"7rAfc/(27r) j . (29) 

It is useful also to define corresponding discrete versions 
Vma = v a {km) of the kernels t/j a (km) 

V ma = V a (k m ) = kmtpa(km) [47TAfc/(27l) 3 ] 1/2 (30) 

which are related to the discrete eigenfunctions u a (km ) by 
the same formula © which relates the kernels ipa(km) 
and the eigenfunctions </> a (km), with the same normaliza¬ 
tion constants c a , 

Ua(km) — , y{km)CaVa{km) • 

If the kernels ip a {k) are normalized on the quadratic con¬ 
vention (;d[), then the discrete kernels v a (km) are similarly 
quadratically normalized, u a (fc m ) 2 = 1. In terms of 

the discrete kernel v a (k m ), the windowed power spectrum 
£ a , equation ([!]), is 
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N 

£a = ^ Va(km)(,{km)km [47rAfc/(27r) 3 ] 1/2 . (32) 

m =0 

Although the indices m and n in the eigenvalue equa¬ 
tion terl) run over values 0 to N, it follows from equa¬ 
tion (|25|), and the periodicity of the discrete cosine trans¬ 
form R{k n ) over the interval n = [— N,N ], that the perime¬ 
ter rows and columns of F are identically zero, F mn = 0 
for m or n = 0 or N. Amongst other things, this guaran¬ 
tees that each eigenfunction u a (km), and consequently each 
kernel u a (fc m ), vanishes at the boundary values m = 0 and 
N. Thus F mn is effectively reduced to a (N — 1) x ( N — 1) 
matrix, with m and n running over values 1 to N — 1. 

Solution of equation (E7f) yields N — 1 orthonormal 
eigenvectors u a (km), and the kernels v a (k m ) then follow 
from equation (|3l|). The eigenfunctions cf> a (km ) and the cor¬ 
responding kernels ip a {km) are given in terms of their dis¬ 
crete counterparts u a {k m ) and v a (km ) by equations ( |29| ) 
and (|30[). The expected variance (A£^) of estimates £ a of 
the power spectrum windowed through the kernel ipa, equa¬ 
tions (|Ij) and (|j), is related to the eigenvalue of -yF 7 in 
equation ( | 2 t| ) by equation (|h]). 

I 11 practice, I do not attempt to solve for the eigen¬ 
functions of the full matrix 7 F 7 , but truncate the matrix 
typically at a half-wavelength of n/(NAk) = 8 /i _ 1 Mpc. At 
a resolution of A k = 7t/( 2048 ft -1 Mpc), this means truncat¬ 
ing the matrix to the first IV — 1 = 255 rows and columns 
(i.e., I only bother to calculate these elements). The cutoff 
at 8 /i -1 Mpc seems reasonable, given that the Fisher matrix 
calculated here assumes Gaussian fluctuations, which may 
be valid on linear scales, down to ~ 8 /i - 1 Mpc, but certainly 
fails below this. 


4 SOME TRIAL KERNELS 


As discussed in §y, there are many distinct bases of kernels 
ipa(k) which yield statistically orthogonal sets of windowed 
power spectra t ; a . Any such basis can be constructed by 
choosing a strictly positive definite scaling matrix 7 , and 
finding the eigenfunctions (j> a = yCa^a. of the scaled Fisher 
matrix 7 T 7 . In all the cases considered here, the scaling 
matrix 7 is taken to be diagonal in Fourier space, so that it 
becomes a scaling function 7 (k). 

I 11 this section I show, for the PSCz geometry, bases 
of kernels for two natural choices of the scaling function, 


7 (k) = 1 in 14.1. and 7 (k) = £(fc) in §4.2 


As will be seen, the resulting kernels are too broad and 
wiggly in fc-space to be of much practical use. The difficulty 
is resolved in the next Section, where a basis of kernels 
which are narrow and positive in Fourier space will be con¬ 
structed. 


4.1 Eigenfunctions of {A£(k)A£(k')) 

A natural choice of the scaling function 7 (k) is a constant 

7 (k) = 1 • (33) 

The kernels ipc,(k) in this case coincide with the scaled ker¬ 
nels (j> a (k ), and are the eigenfunctions of the covariance ma¬ 
trix (A£(fc)A£(fc')). As discussed in §(j, these eigenfunctions 
are in practice constructed as eigenfunctions of the Fisher 



Wavenumber k (h Mpc 1 ) 


Figure 3. A sample of eigenfunctions ip a (k) of the expected 
covariance (A^(k)A^(k')) of minimum variance estimates of the 
power spectrum of the PSCz survey. The eigenfunctions are la¬ 
belled in order of their eigenvalues, the expected variances (A 
the lowest numbered eigenfunction having the smallest variance. 
The grid in k here is N = 256 long, so there are N — 1 = 
255 eigenmodes. The resolution is A k = 7t/( 2048 1 Mpc) = 

0.00153 hMpc -1 . 


matrix T(k,k') = (A^(fe)A£(A/)) _1 . The kernels i/j a (k) here 
form an orthonormal set, a property unique to the case of a 
constant scaling function. 

Figure [i] shows a selection of eigenfunctions ip a (k) of the 
Fisher matrix T(k, k') of the PSCz survey. The Figure shows 
ktl> a (k) rather than il> a (k) in part because the discretized 
version v a (k) of the kernel, equation (|5^), includes the ex¬ 
tra factor of fc, and in part because kiji a {k) gives a better 
impression of the contribution to the windowed power spec¬ 
trum from the power £(fc) at different fc, given that the 
kernel tp a {k) gets multiplied by k 2 ^{k) and that £(fc) 9$ fc _1 
is a reasonable intermediate choice for the slope of the power 
spectrum. 

The kernels shown in Figure [|have the drawbacks that 
for the most part they are neither everywhere positive, nor 
compact in k. A third problem is that when the range in k is 
extended to shorter wavelengths (larger k), the fundamental 
mode at the shortest scales retreats to the smallest available 
scale. These defects are redressed in §|H| 

The kernels shown in Figure exhibit some features 
which are typical also for other choices of the scaling func¬ 
tion 7 (k). The first several (here about 30) modes cover the 
full range of k with relatively broad kernels of relatively low 
variance. Subsequent modes (not shown) retrace the same 
range with ever more oscillatory kernels. The shapes of the 
noisiest kernels resemble violin notes viewed on an oscillo¬ 
scope. Unlike the fundamental at small scales (large k), the 
fundamental at large scales (small k) appears robust against 
changes of resolution in either real or Fourier space, as long 
as the resolution in Fourier space is high enough. 
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Wavenumber k (li VI pc 1 ) 


Figure 4. A sample of kernels !/i a (fc) oc <j> a (k) /£(fc) obtained 
from the eigenfunctions <j> a (k) of the expected relative covari¬ 
ance £(fc) — 1 (A£(fc)A£(fc , ))£(fc , ) —1 of minimum variance estimates 
of the power spectrum of the PSCz survey. The eigenfunctions 
are labelled in order of the eigenvalues, the relative variances 
((A§/£)^), equation (fjj|), the lowest eigenfunction having the 
smallest relative variance. The fundamental mode is centred 
about the peak in the prior power spectrum at k ~ 0.05 hMpc -1 . 


The large scale fundamental, which is almost but not 
quite positive everywhere, happens to have the 18th smallest 
variance (A£„). One should bear in mind that the variance 
depends on the normalization of the kernel, which is taken 
here to be quadratic, equation (^). A linear normalization 
would tend to increase the amplitude and hence variance of 
the wigglier kernels, and would improve the rating of the 
fundamental. However, the ordering of the variances is cor¬ 
rect for the kernels as normalized in Figure [^. 


relative covariance matrix £(fc) -1 (A£(fc) A^(fc'))^(fc , ) _1 , cor¬ 
responding to the case where the scaling function q(fc) is 
chosen equal to the prior power spectrum £( k ) 

7(fc) = £(/c) . (36) 

The eigenvalues X a of the relative covariance matrix 
£(fc) _1 (A£(fc)A£(fc'))£(fc') _1 are equal to the relative vari¬ 
ances ((AOOl) 

\ a = <(A£/0a> (37) 

defined by 

<(A^) 2 a > = —- {A P , -7 • (38) 

Jo [Vk*(fc)£(fc)] 47 r&rdfc/( 27 r ) 3 

The relative variances ((A£/£)„), which can be regarded as 
being defined for arbitrary kernels ip a (k) by equation (^j|), 
have units of inverse volume, and are equal to 2/V in the 
limit of a very large volume, according to equation (^5|). 
It may plausibly be argued that the relative variances 
((A£/£)^) may offer a better measure of noisiness than the 
absolute variances (A^), since equation (|34|) shows that 
the variance of the power spectrum windowed through some 
kernel ip a (k) may be large not because of noise, but sim¬ 
ply because £(fc) itself is large over the range of the kernel 
V'a(fc). 

Figure [ij shows the resulting kernels i/j a (k ) oc 
<j>a{k)/^(k) derived from the eigenfunctions rf) a (k) of the 
relative covariance matrix £(fc) _1 (A£(fc)A^(fc'))^(fc , )“ 1 . As 
usual, the eigenfunctions are in practice found as eigenfunc¬ 
tions of the inverse of this matrix, the scaled Fisher matrix 
£(fc)T(fc, *')£(*')■ 

The kernels are labelled not in order of the absolute 
variances (Af^) as in Figure [sj but rather in order of the 
eigenvalues, the relative variances ((A£/£)^), equation (p8|). 
The single fundamental mode has the smallest relative vari¬ 
ance, and is centred around the maximum at ~ 0.05 /iMpc -1 
in the prior power spectrum. 

The kernels shown in Figure ^ share the same prob¬ 
lem as those in Figure |3| that they are neither compact nor 
everywhere positive in Fourier space. 


4.2 Eigenfunctions of £(fc) 1 {A£(fc)A£(fc , ))£(fc , ) 1 


For a contiguous survey of extremely large (tending to in¬ 
finity) volume V, and for Gaussian fluctuations, the power 
spectrum itself becomes statistically orthogonal, with vari¬ 
ance (Paper 1, eq. [65] with [63]) 

(A£»A|(fc')> = (2 t r) 3 5n(k - k')^P (34) 

where the delta-function (27r ) 3 <5d(/c— A/) is the unit matrix in 
the Fourier representation. It follows from equation © that 
the relative covariance £ _1 (A£A£)£ _1 of the power spectrum 
in an extremely large survey is proportional to the unit ma¬ 
trix 


(At(k)At(k')} 


(2n) 3 S D (k - k')r^ . 


(35) 


This suggests that it might be interesting to consider 
kernels ip a (k) derived from the eigenfunctions 4> a (k) of the 


5 A BASIS OF POSITIVE COMPACT 

KERNELS 

It is evidently desirable that a basis of kernels, besides yield¬ 
ing a statistically orthogonal set of windowed power spectra, 
should be positive and compact in Fourier space. Positivity 
would preserve the intrinsic positivity of the power spec¬ 
trum, while compactness would allow the power spectrum 
to be measured with high resolution in Fourier space. In 
this Section I show how to construct such a basis. A numer¬ 
ically convenient version of the construction is described in 
the paragraph containing equations (|l5|) and (pfi|). 

To understand how to make kernels compact and pos¬ 
itive, consider the mathematical theorem that the eigen- 
modes of a real symmetric matrix are unique up to arbi¬ 
trary orthogonal rotations amongst degenerate eigenmodes, 
that is, amongst eigenmodes with the same eigenvalue. In 
the present case the eigenvalues of the Fisher matrix, even if 
not degenerate, are nevertheless many and finely spaced, and 
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there is much potential for nearly degenerate eigenmodes to 
mix in random unpleasant ways. This suggests that mixing 
can be reduced in Fourier space by lifting the degeneracy of 
eigenvalues in Fourier space, which can be accomplished by 
applying a scaling function y(fc) which is a strongly varying 
function of k. Numerical experiment indicates that the most 
compact and positive kernels arise from choosing the scaling 
function 7 (fc) to be as steep as possible. 

The limiting case of an infinitely steep scaling function 
7 (fc) can be solved analytically, and as will be seen, Figure [sj 
it gives rise to a basis of positive kernels as pretty as one 
could wish for. The solution, given immediately following, is 
simpler than for finite scaling functions 7 (fc), requiring no 
diagonalization at all. 

Suppose that the matrix T(k m ,k n ) is discretized, as is 
the practical case, with m, n = 1,2,..., iV — 1, and sup¬ 
pose that 7 (fci) 3> 7 ^ 2 ) 7 (fcjv-i)- For a uniformly 

spaced grid of k, as here, it is convenient to work with the 
discrete version F mn , equation (]5^), of the Fisher matrix, 
and the associated discrete versions u a {k m ) and v a (k m ) of 
the eigenfunctions <fi a (k m ) and the kernels ip a (k m ), equa¬ 
tions ( pj| ) and (©. The equation to be solved is the discrete 
eigenvalue equation (|27|). 

Since the first column of the scaled Fisher matrix 
'ymF mn 'y n (no implicit summation) is huge compared to 
all other columns, the first eigenfunction ui(fc m ), the one 
with largest eigenvalue, is proportional to this first col¬ 
umn, ui(km) oc 7 m Fmi. It follows that the first kernel 
vi(km) oc ui(km)/jm is proportional to the first column 
of the Fisher matrix, vi (k m ) oc F m 1 . Since the first element 
of the eigenvector u\{k m ) is huge compared to all other ele¬ 
ments, the eigenvector is ui(km) = (1, 0,..., 0) in the limit, 
but the higher elements of the kernel vector vi (k m ) remain 
generally nonzero. 

The remaining eigenvectors u a (km), a = 2 ,N — 1, 
must be orthogonal to the first eigenvector ui(fe m ), which 
means that their first elements must be zero, u a (ki) = 0 
for a > 1. Since the first two columns of the scaled 
Fisher matrix are huge compared to the others, the sec¬ 
ond eigenfunction 112 (fc m ), with the second largest eigen¬ 
value, must be some linear combination of the first two 
columns 'y m F m \ and ') m F m 2 , and it must be proportional 
to that linear combination whose first element vanishes, 
«2 (k m ) OC 7 m(-FiiF’m 2 — Fi 2 -F ml ). The second kernel fol¬ 
lows from V 2 (km) oc u 2 (fc m )/ 7 m oc FuF m 2 - F^Fmi, whose 
first element is zero but whose other elements are finite. 
The second eigenfunction is U 2 (fc m ) = (0,1, 0,..., 0) in the 
limit. Proceeding, one concludes that the ath eigenfunction 
Ua{km) is that linear combination of the first a columns 
of the scaled Fisher matrix which is zero in the first a — 1 
elements. Normalized to u a (k a ) = 1, the eigenfunction is 


Ua(km') — 


l<xF rna 

’yrn.Faa 


(m < a) 


7a 


E F ~ 


1 F n a (m > a) 


(39) 


where Fma denotes the (m, a)th element of the inverse of the 
a x a submatrix consisting of the first a rows and columns 
of F mn . In the limit, the eigenfunction u a (k m ) is 1 in the 
ath place and zero in all others: 


Ua (kn 


1 (TO = a) 

0 (m ^ a) 


(40) 


The associated eigenvalue of the scaled Fisher matrix is 


7a 

Ma - ! 

r OLOL 


(41) 


It follows from equation ( |39| ) that the kernel v a (k„ 
Ua ik m )/7m is given by 


.'-1 F p— 1 

/ J r mn-T not 

77=1 


Vry ( k rrj ) — C- ( , ^ Fm n F ) 


(42) 


where c' a is an arbitrary 

normalization constant. Note v a (km) = 0 for m < a — 1. 
For the quadratic normalization convention adopted in this 
paper, E(fcm) 2 = the normalization constant is 


Ca — 


The usual normaliza- 


SpN-1 F i\2l 1/2 

Z^m=l lEn=l r mri-T na. J 

tion constants Co, which relate cf> a and tp a , equation 
or equivalently u a and v a , equation (^Tj) , are related to the 
normalization constants c' a above by 


Ca — C r>: /7a 


(43) 


The estimators equations (jlj), (|2|), and (|3^), of power 
spectra windowed through the kernels tp a ( k) corresponding 
to v a (k), equation (jllj), are by construction statistically or¬ 
thogonal, in accordance with equation (|tJ). The expected 
variance (A//) is given by formula (|To|), which here reduces 
to 


(ACa) = 


(44) 


miation (^lj), and the 
31), cancelling out. 


the factors of 7 a in the eigenvalue /.i a , eq 
normalization constant c a , equation (M 

In practice, the kernels v a {k m ), equation ([42)) , can be 
evaluated conveniently by the following procedure, which 
reduces the Fisher matrix F mn iteratively to lower triangu¬ 
lar form. Suppose that the discrete Fisher matrix F m „ de¬ 
fined by equation (^s|) has been reduced so far to the point 
where all the above diagonal elements of the first a — 1 rows 
are zero. Call this reduced Fisher matrix which starts 
equal to the Fisher matrix Fmn at step one, a = 1. Then 
the ath kernel v a {k m ) is proportional to the ath column of 
the reduced Fisher matrix 


Va Ikin') — 


F' 


(45) 


where c 1 / is an arbitrary normalization constant. For the 
quadratic normalization convention y) _[ v a (k m ) 2 = 1, the 

normalization constant is c'a = Hna)) ■ The previ¬ 

ous normalization constant c' a in equation (|42j) is related to 
the normalization constant c„ by c' a = d^Faa- The quan¬ 
tity Fa a , which was defined immediately following equa¬ 
tion (|3^), and upon which the eigenvalue /j, a , equation 
also depends, is equal to the reciprocal of the diagonal ele¬ 
ment of the reduced Fisher matrix, F~a = l/T’aa- Thus the 
eigenvalue ([Ilj) is a a = 7^F„a- The variance (A^) of the 
windowed power spectrum £ a is 

<A£> = ■ (46) 

Ca 

Now zero out the ath row of Fm„ above the diagonal 
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Wavenumber k (/?Mpc ') 


Figure 5. Basis of positive kernels obtained by applying an infinitely steep scaling function to the Fisher matrix of the PSCz survey, 
as described in the text, Fj. Counterintuitively, notwithstanding the overlap between kernels, the minimum variance estimators of 
power spectra windowed through these kernels are statistically orthogonal. The resolution is A k = 7r/(896 h ~ 1 Mpc) = 0.00351 fiMpc -1 , 
which is the highest resolution such that all the eigenvalues are positive. The grid in k is N = 112 long, so there are IV — 1 = 111 kernels. 
The crowding of the kernels at the right of the graph is an artefact of truncating the Fisher matrix at Fy = NAk = 7r/(8 h~ 1 Mpc) = 
0.393 hMpc^ 1 , and could easily be avoided by extending the matrix to shorter wavelengths (larger k). 


by subtracting an appropriate multiple of the ath column 
Fma from each higher column, that is, F —> F — 
FliaF' an /F' aoi for n > a + 1. Relabel the resulting matrix 
again F' mn . Repeat the above steps until the matrix is in 
lower triangular form. 

When done, the columns of the lower triangular ma¬ 
trix F^c, will be proportional to the kernels v a (km), equa¬ 
tion (^), while the diagonal elements F' will be propor¬ 
tional to the variances (A£^), equation ([fG|). Note that the 
first a — 1 elements of the ath kernel are zero, v a (km) = 0 
for m < a — 1; in particular, the final kernel is ujv-i(fem) = 
( 0 ,..., 0 , 1 ). 

If the Fisher matrix were strictly positive definite, that 
is, if all its eigenvalues were strictly positive, as should 
be the case theoretically, then all the variances (A£^), 
equation would be guaranteed to be positive. As re¬ 

marked near the beginning of §[i| the eigenvalues of the 
approximate Fisher matrix of the PSCz survey computed 
here are all positive only if the resolution is taken to be 
A k = 7t/( 896 /i -1 Mpc) = 0.00351 hMpc -1 or coarser. This, 
apparently, sets the natural resolution of the PSCz survey. 

Figure ^ shows the resulting basis of kernels for 
the PSCz survey at the ‘natural’ resolution A k = 
7r/(896h -1 Mpc). All of the N — 1 = 111 eigenvalues down 
to the cutoff half-wavelength of n/(NAk) = 8h _1 Mpc are 
positive, and all the 111 kernels are everywhere positive. 
At higher resolutions an increasing fraction of the eigenval¬ 
ues of the Fisher matrix are negative. At a resolution of 
A k = 7t/( 1024 /i -1 Mpc) = 0.00307 hMpc -1 there are 3 neg¬ 
ative of 127 eigenvalues, while at A k = 7r/(1536 h _1 Mpc) = 
0.00204 hMpc -1 there are 27 negative of 191 eigenvalues. 
The kernels corresponding to negative eigenvalues are not 


positive everywhere, and they are not well-behaved. Ker¬ 
nels with positive eigenvalues remain positive and mostly 
well-behaved, even when kernels with negative eigenvalues 
are present, although there is an increasing tendency for 
kernels, especially those adjacent to kernels with negative 
eigenvalues, to become shaky at finer resolutions. 

It is evident that the kernels in Figure |E] overlap sub¬ 
stantially. Yet they yield, by construction, a statistically or¬ 
thogonal set of windowed power spectra. This seems aston¬ 
ishing: how can it be? The answer is that the non-diagonal 
elements of the covariance (A£(fc)A €,{k')) of the power spec¬ 
trum of the survey are not all positive, and the positive and 
negative covariances cancel precisely between overlapping 
kernels. Rather, it is the elements of its inverse, the Fisher 
matrix T(k, k’) = (A£(fc) A^(fc , )) _1 , which are all positive in 
Fourier space, at least in the approximation used here, equa¬ 
tion (^). If the elements of the Fisher matrix are all posi¬ 
tive, then necessarily the elements of its inverse cannot all 
be positive. Numerically, I find that the elements of the (dis¬ 
cretized) covariance (A^(fc)A^(fc')) in the PSCz survey oscil¬ 
late wildly between positive and negative values. This is not 
simply a numerical artefact. In real space, the survey corre¬ 
lation function is a sum £(rij) + of a cosmic 

term £(r»j) and a Poisson sampling term &(ri)~ 1 5D{rij). 
This survey correlation is ill-defined in Fourier space, be¬ 
cause the selection function <I>(ri) vanishes outside the sur¬ 
vey, so that the Poisson sampling term l I>(r;) _1 blows up. It 
is only the inverse of the survey correlation, which leads to 
the Fisher matrix, which is well-behaved in Fourier space. 

The crowding of the kernels to the right of Figure 
is an artefact of truncating the Fisher matrix at a half¬ 
wavelength of n/(NAk) = 8/i -1 Mpc, and could easily be 
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Wavenumber k (h VI pc 1 ) 


Figure 6. Expected relative uncertainties (A^ 2 ) 1 / 2 /^ in the 
power spectra windowed through the kernels (k) of Fig¬ 
ure H plotted against k a = aAk. The solid line shows the ex¬ 
pected uncertainty in the minimum variance estimator, while the 
dashed line shows the expected uncertainty in the zeroth order 
approximation to the minimum variance estimator, as described 
in Paper 1, equations (97) and (100). 


avoided by extending the matrix to shorter wavelengths. I 
show the crowding to illustrate what happens. The kernels 
remain compact, positive, and well-behaved, with no nega¬ 
tive eigenvalues, if the matrix is extended by a factor of 4 to 
N = 448 at the same resolution, corresponding to a mini¬ 
mum half-wavelength of iv/(NAk) = 2 h~ l Mpc. Presumably 
the kernels continue to remain well-behaved beyond this. 

The kernels in Figure are narrowest and highest at 
the peak ~ 0.05 /iMpc -1 in the prior power spectrum. The 
width, hence height, of a kernel in this basis is approximately 
equal to the width of the pair density transform R(k) at the 
corresponding wavenumber k, which in turn is related to 
the prior power spectrum as illustrated in Figure pj. Increas¬ 
ing the prior power spectrum £(fc) would narrow the kernel, 
while reducing the prior power spectrum would broaden it. 
Figure ^ shows that, for the PSCz survey, changing the prior 
power spectrum by a factor of ten would change the width 
of the kernel by a factor of somewhat less than two. If I had 
chosen to use as a prior Peacock’s (1997) power spectrum 
for = 0.3, which is somewhat higher than the adopted 
case $2 = 1, then the kernels in Figure ^ would have been 
somewhat narrower. 

The expected relative uncertainty in the windowed 
power spectrum is given by (A^ 2 ) 1 / 2 /^. Figure shows 
this relative uncertainty for the kernels shown in Figure 
In addition to the uncertainty in the minimum variance es¬ 
timator, Figure ^ also shows the expected uncertainty in 
the zeroth order approximation to the minimum variance 
estimator, equations (97) and (100) of Paper 1. The two 
uncertainties are quite similar, except for the first few ker¬ 
nels at the smallest wavenumbers k, suggesting that the ze¬ 


roth order approximate estimator should be adequate ex¬ 
cept at the longest wavelengths. The sharp rise in the un¬ 
certainty to the right of the graph (and the discrepancy be¬ 
tween the two uncertainties there), is again an avoidable 
artefact of truncating the Fisher matrix at a half-wavelength 
of 7r/(NAk) = 8 /i - 1 Mpc. Roughly speaking, the relative 
variance of a mode a at wavenumber ~ k a is given by 
(A £ 2 }/£ 2 rs 2/(yVk), where V is the effective volume of 
the survey at k a , the volume over which £(fc a )$(r) <; 1 , 
and 14 ~ 4 - 7 rfc 2 Ak a /(2iv) 3 is the coherence volume of the 
mode, Ak a being the width of the mode. Interestingly, the 
selection function of the PSCz survey falls off in such a way 
that the effective volume V initially increases as the wave¬ 
length increases (k decreases) somewhat more rapidly than 
the coherence volume 14 decreases, so causing the relative 
uncertainty shown in Figure ^ to decline mildly with de¬ 
creasing k down to ~ 0.1 /iMpc -1 . 

6 QUESTIONS 

This paper leaves unanswered a number of questions which 
it would be interesting to investigate further. 

(1) Notwithstanding the mathematical argument at the 
beginning of which motivated the idea of applying an 
infinitely steep scaling function to the Fisher matrix, it re¬ 
mains somewhat mysterious as to why the kernels which 
result are indeed all positive. Is positivity true for arbi¬ 
trary survey geometries? Is there some fundamental theorem 
which ensures positivity? 

(2) The kernel set obtained from the infinitely steep 
scaled Fisher matrix ceased to be positive when the resolu¬ 
tion in Fourier space was set too high, and this breakdown 
was signalled by the appearance of negative eigenvalues in 
the discretized Fisher matrix. Presumably the appearance 
of negative eigenvalues is an artefact of the approximation 
to the Fisher matrix used here, since the eigenvalues of the 
true Fisher matrix should all be strictly positive. Neverthe¬ 
less, the numerical results based on the approximate Fisher 
matrix do make physical sense: somehow the numerics know 
that the inverse scale of the survey sets a limit to the reso¬ 
lution at which the power spectrum can be probed. It would 
be nice to have a clearer understanding of what is going on 
here. 

(3) Are there other ways to construct sets of kernels which 
are strictly positive and yield statistically orthogonal sets 
of power spectra, besides applying infinitely steep scaling 
functions? Clearly the kernel set illustrated in Figure ^ is not 
unique, since there are other ways of ordering the elements 
of the scaling function than the ordering y(fci) 7 (fe) 

... S> 7 (/cat-i) adopted there. I have checked for example 
that the inverse ordering 7 (^ 1 ) 4C 7 (^ 2 ) -C ... <C 'f(kN-i) 
also leads to a strictly positive set of kernels for the PSCz 
survey. Whereas the former ordering leads to kernels which 
have tails to large k , the opposite ordering leads to kernels 
with tails to small k. 

I conclude with a caveat about applying the results of 
this paper and Paper 1. The Fisher matrix computed here 
is for the unredshifted power spectrum, whereas the galaxy 
positions in 3-diinensional surveys such as the PSCz survey 
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lie in redshift space, and are subject to distortions from pe¬ 
culiar velocities. Although, as explained in §4 of Paper 1, 
the Fisher matrix of the unredshifted power spectrum is the 
same irrespective of whether the data lie in real or redshift 
space, the minimum variance pair window is not the same 
in both real and redshift space. While there is no harm in 
applying a pair window which is not minimum variance (see 
Paper 1, §7 for further clarification of this point), it would be 
incorrect to carry over the pair window which is minimum 
variance in real space and imagine that it is also minimum 
variance for the power spectrum in redshift space. I hope to 
address the issue of the optimal measurement of the power 
spectrum from data in redshift space in a subsequent paper. 
Such a measurement would include a simultaneous measure¬ 
ment of the linear redshift distortion parameter /3 ss fi°' 6 /fo. 


7 SUMMARY 

This is the second of a pair of papers which address the 
problem of how to estimate the unredshifted power spectrum 
of fluctuations from a galaxy survey in optimal fashion. 

It has been shown that there are many distinct bases 
of kernels ip a (k) which yield statistically orthogonal sets £ a 
of estimates of windowed power spectra. With each distinct 
basis is associated a positive definite scaling matrix 7 . A 
basis of kernels is constructed by finding the eigenfunctions 
(p a oc yip a of the scaled Fisher matrix 7 T 7 . 

Illustrative results have been presented for the case 
of the PSCz survey (Saunders et al., in preparation). The 
Fisher matrix of the unredshifted power spectrum of this 
survey was computed assuming Gaussian fluctuations and 
the zeroth order FKP approximation proposed in Paper 1, 
together with the prior power spectrum proposed by Pea¬ 
cock (1997). Examples of bases of kernels yielding statis¬ 
tically orthogonal sets of windowed power spectra for this 
survey have been shown. 

The main result of the paper is that, among the many 
possible bases of kernels which give rise to statistically or¬ 
thogonal sets of windowed power spectra, there is a partic¬ 
ular basis, obtained by applying an infinitely steep scaling 
function to the Fisher matrix, which leads to kernels which 
are compact and everywhere positive in Fourier space. The 
basis is illustrated for the PSCz survey in Figure Bl This ba¬ 
sis is easily computed from the Fisher matrix, without even 
the necessity to diagonalize. The basis of kernels, along with 
the associated minimum variance pair weighting described 
in Paper 1, §§2.3 and 7, would appear to offer a solution 
to the problem of how to measure the unredshifted power 
spectrum of a galaxy survey in optimal fashion. 
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